CS224N Final Project Abstract Meta-Concept Features for Text-Illustration
نویسندگان
چکیده
Cross language-image retrieval is a problem of high interest that is at the frontier between computer vision and natural language processing. State-of-the-art methods learn a common space with regard to some constraints of correlation or similarity from two textual and visual modalities that are processed in parallel and possibly jointly. This paper proposes a different approach that considers the cross-modal problem as a supervised mapping of visual modalities to textual ones. Each modality is thus seen as a particular projection of an abstract meta-concept. In practice, this space is learned through an asymmetric process, where the textual modality is used to generate a multi-label representation, further used to map the visual modality through a simple multi-layer perceptron. While being quite easy to implement, the experiments show that our approach significantly outperforms the state-of-the-art on FlickR-8K and FlickR-30K datasets for the text-illustration task.
منابع مشابه
CS224N Final Project Sentiment Analysis of Tweets: Baselines and Neural Network Models
The goal of sentiment analysis is to classify text samples according to their overall positivity or negativity. We refer to the positivity or negativity of a text sample as its polarity. In this project, we investigate three-class sentiment classification of Twitter data where the labels are “positive”, “negative”, and “neutral”. We explore a number of questions in relation to the sentiment ana...
متن کاملCS224n Final Project
I introduce a novel method for disambiguating word senses using a semisupervised approach. I contrast this method with the current state-of-the-art approaches and show that my approach performs well and could potentially lead to fully unsupervised approaches with high accuracy.1
متن کاملCS224N: Investigating SMS Text Normalization using Statistical Machine Translation
In this project we explore two approaches to SMS text normalization. First we try a dictionary substitution approach used by most websites that provide such a service, and then modify it with our extension. This is followed by a statistical machine translation (MT) approach using off the shelf MT tools. We evaluate the performance of our system on three test sets from different sources and disc...
متن کاملImage classification using multimedia knowledge networks
This paper presents novel methods for classifying images based on knowledge discovered from annotated images using WordNet. The novelty of this work is the automatic class discovery and the classifier combination using the extracted knowledge. The extracted knowledge is a network of concepts (e.g., image clusters and word-senses) with associated image and text examples. Concepts that are simila...
متن کاملImage Annotation and Two Paths to Text Illustration
This paper describes our participation to the ImageCLEF 2016 scalable concept image annotation main task and Text Illustration teaser. Regarding image annotation, we focused on better localizing the detected features. For this, we identified the saliency of the image to collect a list of potential interesting places into the image. We also added a specific human attribute detector that boosted ...
متن کامل